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METHODS OF MODIFYING THE PRODUCTION OF ISOPENTENYL 
PYROPHOSPHATE, DIMETHYLALLYL PYROPHOSPHATE AND/OR 

ISOPRENOIDS 

BACKGROUND OF T HE INVENTION 
5 Field of the Invention 

The present invention is directed to genes encoding deoxyxyluIose-5- 
phosphate synthase (dxps), glyceraldehyde-3-phosphate dehydrogenase (gapd), 
and the lytB gene product, as well as vectors containing the same and hosts 
transformed with said vectors. The present invention also provides methods for 
10 modifying the production of isopentenyi pyrophosphate (IPP) and/or dimethylallyl 
pyrophosphate (DMAPP) and/or an isoprenoid compound (i.e., a compound derived 
from IPP and/or DMAPP). Additionally, the present invention provides a method 
for screening for procaryotic and eukaryotic genes encoding enzymes that 
participate in the nonmevalonate pathway leading to IPP and DMAPP. 

Background of the Invention 
A plethora of chemical compounds are produced by what are collectively 
known as the pathways of Isoprenoid (a.k.a. terpene) biosynthesis. The one feature 
in common to the many isoprenoids (more than 20,000 have been identified in 
plants; Chappeil, 1995) is their biosynthesis from the central metabolites and 
building blocks for all isoprenoid compounds: the 5 carbon compounds isopentenyi 
pyrophosphate (IPP) and its allylic isomer dimethylallyl pyrophosphate (DMAPP). 
The interconversion of IPP and DMAPP is reversible, and is catalyzed by the 
enzyme IPP isomerase (IPI). Using the IPP and DMAPP building blocks in various 
combinations, a modular assembly process that produces compounds of 5, 10, 15, 
20 or more carbons (in multiples of 5) allows the biosynthesis of the basic skeletons 
for the many and various isoprenoids with a relatively small number of basic 
reaction steps. The C40 skeleton of plant carotenoid pigments, for instance, is 
assembled from two molecules of a C20 compound, geranylgeranyl pyrophosphate 
(GGPP), that is itself assembled from 3 units of IPP and 1 unit of DMAPP, and that 
also serves as a precursor for many other branches of the isoprenoid pathway in 
plants. 

Isoprenoids are formed from IPP and DMAPP in at least three different 

1 
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compartments of plants cells: the cytosol/endopiasmic reticulum, the mitochondria 
(and/or Golgi apparatus), and the plastids (McGarvey and Croteau, 1995). The 
source of IPP and DMAPP for isoprenoid biosynthesis in these different 
compartments has long been a matter of some controversy and debate (see Bach, 
5 1995; McGarvey and Croteau, 1995). The well-known "classical" or 
acetate/mevaionate route to IPP and DMAPP proceeds from acetyi-CoA via 3- 
hydroxy-3-methylglutaryl-CoA (HMG-CoA) and mevalonic acid (MVA). The critical, 
rate-determining and irreversible step in this route in animals is the reduction of 
HMG-CoA to produce MVA, catalyzed by the enzyme HMG-CoA reductase 

10 (HMGR; EC 1.1J.34; reviewed in Goldstein and Brown, 1990). This enzyme also 
appears to provide an important control point for substrate flow into certain 
isoprenoids in plants (Bach, 1986; Chappell et al., 1995; Schaller et ah, 1995; 
Stermer et aK, 1994). HMGR has been localized in the cytosol of plants, 
associated with membranes of the endoplasmic reticulum (reviewed in Bach, 1995). 

15 There is no convincing experimental evidence that this key enzyme also resides in 
the plastids or mitochondria of plants. An uptake by plastids of IPP to support 
isoprenoid biosynthesis in this organelle has been suggested in the past (Kreuz and 
Kleinig, 1984). However, it has been difficult to reconcile a cytosolic source of IPP 
for plastid isoprenoids with the effective inhibition of cytoplasmically-made 

20 isoprenoids (e.g. sterols) but not of plastid-made isoprenoids (e.g. carotenoids) or 
mitochondrial ones (quinones) by mevinolin, a specific inhibitor of HMG-CoA 
reductase (discussed in Bach, 1995 and McGarvey and Croteau, 1995). 
Furthermore, the demonstrated ability of isolated chloroplasts of Acetabularia 
(Moore and Sheperd, 1977) and spinach (Schulze-Seibert and Schultz, 1987) to 

25 synthesize carotenoids and other isoprenoids from ^^^002 necessarily implies a 
chloroplast pathway for IPP biosynthesis. Further confusion arose from reports that 
radiolabelled acetate is efficiently incorporated by isolated chloroplasts into fatty 
acids but not into carotenoids (see Kleinig, 1989; Liedvogei, 1986; Schulze-Seibert 
and Schultz, 1987), a puzzling observation that has been explained by invoking a 

30 metabolic "channeling" that somehow renders the newly-formed acetyl-CoA pool 
available for fatty acid biosynthesis but unavailable for IPP biosynthesis (Kleinig, 
1989). 
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An attractive explanation for these many puzzling and apparently 
contradictory observations is provided by the suggestion of an "alternative" or non- 
mevaionate pathway for the biosynthesis of IPP and/or DMAPP in the chloroplasts 
of plants (Lichtenthaler et aL, 1997) and in algae (Schwender et a)., 1996). 
5 Existence of a distinct non-mevalonate biochemical pathway to IPP and/or DMAPP, 
using pyruvate rather than acetate as a substrate, was posited for certain bacteria, 
including Escherichia coli, as early as 1981 (Pandian et at., 1981), but this 
alternative pathway in bacteria received little attention until relatively recently (Zhou 
and White, 1991; Rohmeret ai., 1993 and Horbach etal., 1993; see Bach, 1995 
10 for a critical discussion of this topic). It appears that many bacteria do not have the 
acetate/mevalonate pathway for biosynthesis of IPP. For instance, a BLAST 
search of the E. coli genome, the entire sequence for which has now been 
deposited in GenBank, does not identify any open reading frames with significant 
similarity to known bacterial, plant or mammalian HMG-CoA reductases. Nor can 

1 5 we discern an HMG-CoA reductase homologue in the genome sequence data base 
of the cyanobacterium Synechococystis 6803. Because ancestors of the 
cyanobacteria are the presumptive progenitors of plant chloroplasts, the non- 
mevalonate route to IPP in these photosynthetic procaryotes is likely to resemble 
that in plant chloroplasts. 

20 Recent reports indicate that plant and algal chloroplast isoprenoids, including 

carotenoids (Lichtenthaler et a!., 1997; Schwender et aL, 1996), and the medically- 
important isoprenoid paclitaxel (a.k.a. TAXOL®) (Eisenreich et al., 1996) are made 
from IPP supplied via a non-mevalonate pathway. Whether this pathway supplies 
IPP for carotenoids and other isoprenoids manufactured in the various 

25 nonphotosynthetic plant plastids (e.g. chromoplasts, leucoplasts, amyloptoasts) or 
in etioptasts or developing chloroplasts is an open question. Several recent reports 
describe a thiamin-dependent enzyme from E. co// (Sprenger et ai., 1997; Lois et 
al., 1998) and from peppermint (Lange et al., 1998) that converts pyruvate and 
glyceraldehyde-3-phosphate (GAP) to deoxyxylulose-5-phosphate (DXP). A 

30 homologue of this enzyme was earlier described for Arabidopsis thaliana (Mandel 
et al., 1996), but the function of the A. thaliana polypeptide had not been 
ascertained. 
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The evidence is compelling that DXP is a substrate in the pathway leading 
to (PP or DMAPP in plants, algae, and certain bacteria including £, colt (Arigoni et 
al., 1997). However, DXP also is a substrate leading to other essential compounds 
in these organisms, including thiamin and pyridoxal (see Lois et al., 1998). 
Therefore the biochemical reaction catalyzed by the DXP synthase enzyme is not 
dedicated or restricted to providing substrate for the pathway or pathways leading 
to IPP and/or DMAPP, and isoprenoids derived therefrom. Because the product 
of the reaction is shared with several other pathways, the DXP synthase enzyme 
would not seem to be an obvious candidate for a controlling step in the pathway 
leading to isoprenoids. An ability to modify isoprenoid production, enhancing 
production of desired compounds or reducing that of undesirable compounds, 
would be quite advantageous in many applications. 

Plausible routes to IPP from pyruvate and GAP via DXP have been 
suggested for plants and bacteria (Schwender et a/., 1996; Rohmer et af., 1993 and 
1996; see Bach, 1995). A gene encoding what may be the second enzyme in the 
pathway, DXP reductoisomerase, has recently been identified in Escherichia coll 
(Takahashi et al., 1998). The precise nature and sequence of subsequent 
biochemical reactions in the nonmevalonate pathway, and whether regulation of 
flux through this pathway occurs primarily at a single controlling step, as with HMG- 
CoA reductase for the mevalonate pathway, are so far unknown. We have taken 
a genetic approach to delineate the pathway leading to IPP and DMAPP in E, coll 
and plants. 

We earlier made the surprising observation that the reversible 
interconversion of IPP and DMAPP limits the accumulation of carotenoids in £. coH 
(an observation alluded to in Sun et al., 1996 and Sun et al., 1998). Others 
(Kajiwara et ai,, 1997) have made a similar observation. As a direct result of 
continuing to pursue the same empirical approach (using a colored isoprenoid 
compound to "report" on the amount of IPP and DMAPP available for isoprenoid 
biosynthesis) which revealed that the isomerization of IPP is a limiting step in 
isoprenoid biosynthesis (see U.S. Patent No. 5,744,341 ), we have now identified 
several other genes which can be used to modify IPP, DMAPP and/or isoprenoid 
production in host cells. 
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For two of the these genes the functions of the products, glyceraldehyde-3- 
phosphate dehydrogenase and DXP synthase, are well known. The role of the 
product of the third, the lytB gene, is still obscure. Because certain mutations in 
this gene in Escherichia coii confer an ability to tolerate elevated levels of penicillin 
(Harkness et a!., 1992; Gustafson et al., 1993) the lytB gene product would seem 
to play some role in or otherwise influence the biosynthesis of peptidoglycan. It has 
been suggested that the iytB gene product is not an enzyme directly involved in the 
pathway leading to peptidoglycan but rather exerts some control on the production 
of the global regulator molecule guanosine 3',5'-bisphosphate (ppGpp) (Gustafson 
et al., 1993; Rodionov and Ishiguro, 1995). The evidence in support of this 
suggestion is correlative and indirect and therefore by no means convincing. 
Because iytB Influences the accumulation of certain isoprenoids in E. coli (see 
below), because an analysis of the nearly twenty completed bactehal genomes 
reveals it to be one of the very few genes found only in those bacteria that also 
contain genes encoding DXP synthase and DXP reductoisomerase, and because 
we have isolated an apparently chloroplast-targeted IytB cDNA from a eucaryotic 
plant, we speculate that lytB may encode an enzyme that catalyzes one of the later 
reactions in the nonmevalonate pathway leading to IPP and DMAPP in certain 
bacteria and in plant chioroplasts. To avoid confusion, it is important to note that 
another gene, which is completely unrelated to the gene under consideration here, 
has unfortunately also been given the name lytB (see, e.g., Garcia et al., 1999). 

There remains a need for methods to enhance the production of desirable 
isoprenoids and reduce the accumulation of undesirable isoprenoids in plants, fungi 
and bactena that contain a nonmevalonate pathway. There also remains a need 
in the art for methods for screening for procaryotic and eukaryotic genes encoding 
enzymes of isoprenoid biosynthesis and metabolism. 

SUMMAR Y OF TH E INVE NT ION 
A subject of the present invention is an isolated nucleic acid sequence which 
encodes for a protein having DXP synthase enzyme activity, wherein the nucleic 
acid sequence is at least 85% identical to SEQ ID NO: 1 or the nucleic acid 
sequence encodes a protein which has an amino acid sequence which is at least 
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85% identical to SEQ ID NO: 2. 

Another subject of the present invention is an Isolated nucleic acid sequence 
which encodes for a protein having GAP dehydrogenase enzyme activity, wherein 
the nucleic acid sequence is at least 85% identical to SEQ ID NO: 3 or the nucleic 
5 acid sequence encodes a protein which has an amino acid sequence which is at 
least 85% identical to SEQ ID NO: 4. 

A further subject of the present invention is an isolated nucleic acid 
sequence which encodes for a protein having LYTB activity, wherein the nucleic 
acid sequence is at least 85% identical to SEQ ID NO: 5 or the nucleic acid 
10 sequence encodes a protein which has an amino acid sequence which is at least 
85% identical to SEQ ID NO: 6. 

The invention also includes vectors which comprise any of the nucleic acid 
sequences listed above, and host cells transformed with such vectors. 

Another subject of the present invention is a method of enhancing the 
i5 production of IPP, DMAPP and/or an isoprenoid compound in a host ceil, 
comprising inserting into the host ceil a vector comprising a heterologous nucleic 
acid sequence which encodes for a protein having DXP synthase, GAP 
dehydrogenase and/or LYTB activity, wherein the heterologous nucleic acid 
sequence is operabiy linked to a promoter; and expressing the heterologous nucleic 
20 acid sequence, thereby producing the protein. 

Yet another subject of the present invention is a method of modifying the 
production of IPP, DMAPP and/or an isoprenoid compound in a host cell, 
comprising Inserting into the host cell a vector comprising a heterologous nucleic 
acid sequence which encodes for a protein which modifies DXP synthase, GAP 
25 dehydrogenase and/or LYTB activity in the host cell, wherein the heterologous 
nucleic acid sequence is operabiy linked to a promoter; and expressing the 
heterologous nucleic acid sequence, thereby producing the protein. 

The present invention also includes a method of expressing, in a host cell, 
a heterologous nucleic acid sequence which encodes for a protein having DXP 
30 synthase, GAP dehydrogenase and/or LYTB activity, the method comprising 
inserting into the host cell a vector comprising the heterologous nucleic acid 
sequence, wherein the heterologous nucleic acid sequence is operabiy linked to a 

6 

BNSDOCID- <WO_^9Q5884QA^J_> 



wo 99/58649 



PCT/US99/0704I 



promoter; and expressing the heterologous nucleic acid sequence. 

Also included is a method of expressing, in a host cell, a heterologous 
nucleic acid sequence which encodes for a protein which modifies DXP synthase, 
GAP dehydrogenase and/or LYTB activity in the host cell, the method comprising 
5 inserting into the host cell a vector comprising the heterologous nucleic acid 
sequence, wherein the heterologous nucleic acid sequence is operably linked to a 
promoter; and expressing the heterologous nucleic acid sequence. 

Another subject of the present invention is to provide a method for screening 
for eukaryotic genes which encode enzymes involved in isoprenoid biosynthesis 
10 and metabolism. 

BRIEF DESCRIPTION OF THE DRAWINGS 
A more complete appreciation of the invention and many of the attendant 
advantages thereof will be readily obtained as the same becomes better 
understood by reference to the following detailed description when considered in 
15 connection with the accompanying drawings, wherein: 

Figure 1 is an illustration showing that IPP and DMAPP serve as the central 
metabolites leading to an immense variety of different isoprenoid compounds in 
plants, bacteria, and other organisms. The mevalonic acid pathway via acetyl-CoA 
in the cytosol of plants and in animals is well characterized. A second route from 

20 pyojvate and GAP, not yet elucidated, is thought to operate in the chloroplasts of 
plants and algae, in cyanobacteria, and in many bacteria. Later reaction steps in 
the pathway remain to be determined and are therefore denoted with question 
marks. The pathway is shown leading to a box containing IPP and DMAPP 
because it has not been established which of these two is the initial product of this 

25 pathway. Abbreviations: G3P, glyceraldehyde-3-phosphate; G3PD, GAP 
dehydrogenase; DXPS, DXP synthase; IPP, isopentenyl pyrophosphate; IPl, IPP 
isomerase; DMAPP, dimethylallyl pyrophosphate; FPP. farnesyl pyrophosphate; 
GPP geranyl pyrophosphate; GGPP, geranylgeranyi pyrophosphate. 

Figure 2 illustrates a route to IPP/DMAPP from pyruvate and glyceraldehyde-3- 
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phosphate (GAP), which is thought to operate in the chloroplasts of plants and 
algae, in cyanobacteria, and in nnany bacteria including E. coli. Later reaction steps 
in the pathway remain to be deternnined and are therefore denoted with question 
marks. The pathway is shown leading to a box containing IPP and DMAPP 
5 because it has not been established which of these two is the initial product of this 
pathway. Enzymes of interest are shown in bold white text in a black box. 
Abbreviations: DMAPP, dimethylallyi pyrophosphate; DXPS, deoxyxyluiose-5- 
phosphate synthase; DXPR, deoxyxyiulose-5-phosphate reductoisomerase; FPS, 
farnesyl pyrophosphate synthase, GAPD, giyceraldehyde-3-phosphate 
10 dehydrogenase; GGPP, geranylgeranyl pyrophosphate; IPI, isopentenyl 
pyrophosphate isomerase; IPP, isopentenyl pyrophosphate. 

Figure 3 illustrates that the C^^ carotenoid phytoene is derived by a head-to-head 
condensation of two molecules of the C20 GGPP compound, which itself is 
assembled from 3 molecules of IPP and 1 molecule of DMAPP, Abbreviations: 
15 FPP, farnesyl pyrophosphate; GPP geranyi pyrophosphate; PPPP, prephytoene 
pyrophosphate. 

Figure 4 is a schematic illustration and restriction map of plasmid pAC-LYC 
(Cunningham et aL, 1994) which contains genes (crtE, crtB, and crti) of Erwinia 
herbicola encoding all of the enzymes required for production of the pink-colored 
20 isoprenoid pigment lycopene from the colorless IPP and DMAPP compounds. Cm, 
chloramphenicol resistance gene. 

Figure 5A is a cDNA sequence and Figure 5B is the predicted amino acid sequence 
of a putative DXP synthase isolated from a flower cDNA library of Tagetes erecta 
(SEQ ID NOS: 1 and 2). The cDNA is incorporated into the plasmid pMarDXPS. 

25 Figure 6A is a cDNA sequence and Figure 6B is the predicted amino acid sequence 
of a chloroplast isoform of GAP dehydrogenase isolated from Arabidopsis thafiana 
(SEQ ID NOS: 3 and 4). The cDNA is incorporated into the plasmid pAtG3PD. 
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Figure 7A is a cDNA sequence and Figure 7B is the predicted amino acid sequence 
of a LYTB protein isolated from an Adonis palaestina flower cDNA library (SEQ ID 
NOS; 5 and 6). The sequence Is of clone ipl3 except that the 14 bp at the N- 
terminus were obtained from the slightly longer cDNA clone Ipi18. The cDNA is 
incorporated into the plasmid pApLYTB. 

Figure 8 is an alignment of the predicted amino acid sequences of LYTB from 
Adonis palaestina, Synechocystis PCC6803 and E. coli. The N-terminal extension 
of the Adonis polypeptide, relative to that of Synechocystis PCC6803, is predicted 
by the program ChloroP (Emanuelsson et a!., 1999) to constitute a chioroplast 
transit peptide, serving to target the polypeptide to this organelle in plants. Black 
boxes with white letters are used where all three of the aligned residues are 
identical. Grey boxes with black letters are used where two of the three aligned 
residues are identical. Abbreviations: Ap, Adonis palaestina: Sy, Synechocystis 
PCC6803\ Ec, Escherichia coli. 

Figure 9 is a schematic representation of the mapping of an E. coli genomic 
fragment to ascertain which of the genes in this fragment will enhance or impair 
lycopene accumulation in E. coli. Deletion mapping of an E coli genomic fragment 
(the insert in plasmid pEc3.9 is essentially identical to GenBank U32768: 
4879. .8819) with genes encoding DXP synthase (orf620), FPP synthase (ispA) and 
the small subunit of exonuclease VII (xseS). Numbers to the right indicate relative 
lycopene accumulation per mL of liquid culture with plasmids in lycopene- 
accumulating E. co// strain TOP10. The insert in pEc3.9 is oriented in the forward 
direction in the multicopy plasmid vector pBluescnpt SK-. The EcoRl and Smal 
sites in the vector preceding the genomic fragment and a Kpn\ site following it were 
used, along with Ndel, Smal, and Sa/i sites in the genomic DNA, to construct the 
deletion subclones illustrated. Incomplete genes and open reading frames are not 
shown. 



DESCRIPTIO N OF THE PRFFERRED EMBODIMENTS 



The term "isoprenoid" is intended to mean any member of the class of 
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naturally occurring compounds whose carbon skeletons are composed, in part or 
entirely of isopentyl C5 units. Preferably, the carbon skeleton is of an essential oil, 
a fragrance, a rubber, a carotenoid, or a therapeutic compound, such as paclitaxet. 
A search of the public databases (GenBank) for genes encoding 
5 homologues of the E. coii lytB gene product unearthed such genes in twelve 
bacteria and in the cyanobacterium Synechocystis PCC6803, A recent publication 
(Potter et aL, 1998) discusses the distribution of this gene in various bacteria. Our 
description herein of a homologue in the plant Adonis palaestina reveals for the first 
time that this gene is present and expressed in a eucaryotic organism. A recently 

10 deposited genomic DNA sequence for the green plant Arabidopsis thatiana 
(GenBank accession number AL035521 } contains what appears to be a gene (the 
probable coding sequence is interrupted by several apparent introns) encoding 
LYTB in this organism. The predicted sequence of this Arabidopsis LYTB is 
somewhat uncertain (a comparison of the Adonis sequence with that listed given 

15 in AL035521 for Arabidopsis suggests that several of the exon-intron junctions 
predicted for this gene in the GenBank record are incorrect), but sequence identity 
in a comparison with the Adonis sequence is ca. 80% or more. Partial cDNA 
sequences in the data base of expressed sequence tags (dbEST) that predict 
peptides with sequence similarity to portions of the Adonis LYTB sequence indicate 

20 that homologues exist and mRNAs encoding LYTB are produced in several other 
plant species including rice (D45948), loblolly pine (AA556723) and soybean 
(AI437981). Both the Adonis and Arabidopsis predicted amino acid sequences are 
more than 60% identical to that predicted by the cyanobacterium Synechocystis 
PCC6803 gene, and the two plant and the cyanobacterial sequences are more than 

25 30% identical to the predicted E coli gene product. An alignment of the Adonis, 
Synechocystis PCC6803 and E, coli predicted amino acid sequences is shown in 
Figure 7. A number of regions and residues conserved in LYTB are indicated in 
this Figure. 

The term "LYTB activity" is intended to mean the ability of LYTB to affect the 
30 production of IPP, DMAPP and/or isoprenoids in a host ceil containing the lytB 
gene or DNA copy of the lytB mRNA. It has not yet been confirmed that the LYTB 
protein is, in fact, an enzyme. The precise role of the LYTB protein in affecting the 

10 

BNSOOCIO- <WO 99See49A1_L> 



wo 99/58649 



PCT/US99/0704I 



production of isoprenoids has not been established. However, whatever the 
mechanism of action of the lytB gene product, we intend to cover that mechanism 
of action by the term "LYTB activity". 

The present invention is directed to an isolated nucleic acid sequence which 
5 encodes for a protein having DXP synthase enzyme activity, wherein the nucleic 
acid sequence is at least 85% Identical to SEQ ID NO: 1 or the nucleic acid 
sequence encodes a protein which has an amino acid sequence which is at least 
85% identical to SEQ ID NO: 2. Preferably, the nucleic acid sequence is at least 
90%, at least 95% or completely identical to SEQ ID NO: 1, or the nucleic acid 
10 sequence encodes a protein which has an amino acid sequence which is at least 
90%, at least 95% or completely identical to SEQ ID NO: 2. 

Another subject of the present invention is an isolated nucleic acid sequence 
which encodes for a protein having GAP dehydrogenase enzyme activity, wherein 
the nucleic acid sequence is at least 85% identical to SEQ ID NO: 3 or the nucleic 
15 acid sequence encodes a protein which has an amino acid sequence which is at 
least 85% identical to SEQ ID NO: 4. Preferably, the nucleic acid sequence is at 
least 90%, at least 95% or completely identical to SEQ ID NO: 3, or the nucleic acid 
sequence encodes a protein which has an amino acid sequence which is at least 
90%, at least 95% or completely identical to SEQ ID NO: 4. 
20 A further subject of the present invention is an isolated nucleic acid 

sequence which encodes for a protein having LYTB activity, wherein the nucleic 
acid sequence is at least 85% identical to SEQ ID NO: 5 or the nucleic acid 
sequence encodes a protein which has an amino acid sequence which is at least 
85% identical to SEQ ID NO: 6. Preferably, the nucleic acid sequence is at least 
25 90%, at least 95% or completely identical to SEQ ID NO: 5, or the nucleic acid 
sequence encodes a protein which has an amino acid sequence which is at least 
90%, at least 95% or completely identical to SEQ ID NO: 6. 

In each case, sequence similarity is measured using sequence analysis 
software, for example, the Sequence Analysis software package of the Genetics 
30 Computer Group (University of Wisconsin Biotechnology Center, 1710 University 
Avenue, Madison, Wisconsin 53705), MEGAIign (DNAStar, Inc.. 1228 S. Park St., 
Madison, Wisconsin 53715), or MacVector (Oxford Molecular Group, 2105 S. 
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Bascom Avenue. Suite 200, Campbell, California 95008). Such software matches 
similar sequences by assigning degrees of identity to various substitutions, 
deletions, and other modifications. Conservative (i.e. similar) substitutions typically 
include substitutions within the following groups: glycine and alanine; valine, 
5 isoleucine and leucine; aspartic acid, glutamic acid, asparagine and glutamine; 
serine and threonine; lysine and arginine; and phenylalanine and tyrosine. 
Substitutions may also be made on the basis of conserved hydrophobicity or 
hydrophilicity (see Kyte and Doolittle. J. Moi Biol 157: 105-132 (1982)), or on the 
basis of the ability to assume similar polypeptide secondary structure (see Chou 

10 and Fasman, Adv. Enzymol. 47: 45-148 (1978)). If comparison is made between 
nucleotide sequences, preferably the length of comparison sequences is at least 
50 nucleotides, more preferably at least 60 nucleotides, at least 75 nucleotides or 
at least 100 nucleotides. It is most preferred if comparison is made between the 
nucleic acid sequences encoding the protein coding regions necessary for protein 

15 activity. If comparison is made between amino acid sequences, preferably the 
length of comparison is at least 20 amino acids, more preferably at least 30 amino 
acids, at least 40 amino acids or at least 50 amino acids, ft is most preferred if 
comparison is made between the amino acid sequences in the protein coding 
regions necessary for protein activity. 

20 The present inventors have isolated eukaryotic genes encoding DXP 

synthase from Tagetes erecta (marigold), GAP dehydrogenase from Arabidopsis 
thaliana, and LYTB from Adonis palaestina. All were identified on the basis of an 
enhancement of lycopene accumulation in E, cofi. The £. coli DXP synthase was 
also identified in this same way. 

25 Suitable vectors according to the present invention comprise a gene 

encoding one or more of the above-identified enzymes involved in IPP, DMAPP 
and/or isoprenoid biosynthesis or metabolism, wherein the gene is operably linked 
to a suitable promoter. Suitable promoters for the vector can be constructed using 
techniques well known in the art (see, for example, Sambrook et al., Molecular 

30 Cloning A La borator y Manual , Cold Spring Harbor Laboratory, Cold Spring Harbor, 
NY, 1989; Ausubel et al., Curmnt Protocols in Molecular Biology . Greene 
Publishing and Wiley Interscience, New York, 1991). Suitable vectors for 
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eukaryotic expression in plants are described in Frey et al., Plant J. (1995) 8(5):693 
and Misawa et a(, 1994a. Suitable vectors for prokaryotic expression include 
pACYC184, pUC1 19, and pBR322 (available from New England BioLabs, Bevery, 
MA) and pTrcHis (Invitrogen) and pET28 (Novagen) and derivatives thereof. The 
5 vectors of the present invention can additionally contain regulatory elements such 
as promoters, repressors, selectable markers such as antibiotic resistance genes, 
etc., the construction of which is very well known in the art. 

One or more of the genes encoding the enzymes as described above, when 
cloned alone or in combination into a suitable expression vector, can be used to 
10 overexpress these enzymes in a plant expression system or to inhibit the 
expression of these enzymes. For example, a vector containing one or more of the 
genes of the invention may be used to increase the amount of isoprenoids in an 
organism and thereby alter the nutritional or commercial value or pharmacology of 
the organism, 

15 Therefore, the present invention includes a method of enhancing the 

production of IPP, DMAPP and/or an isoprenoid in a host cell, relative to an 
untransformed host ceil, the method comprising inserting into the host cell a vector 
comprising a heterologous nucleic acid sequence which encodes for a protein 
having DXP synthase, GAP dehydrogenase and/or LYTB activity, wherein the 

20 heterologous nucleic acid sequence is operably linked to a promoter; and 
expressing the heterologous nucleic acid sequence. 

The invention also includes a method of modifying the production of IPP, 
DMAPP and/or an isoprenoid in a host cell, the method comprising inserting into 
the host cell a vector comprising a heterologous nucleic acid sequence which 

25 encodes for a protein which modifies DXP synthase, GAP dehydrogenase and/or 
LYTB activity in the host cell, relative to an untransformed host cell, wherein the 
heterologous nucleic acid sequence is operably linked to a promoter; and 
expressing the heterologous nucleic acid sequence. 

The invention further includes a method of expressing, in a host cell, a 

30 heterologous nucleic acid sequence which encodes for a protein having DXP 
synthase, GAP dehydrogenase and/or LYTB activity, the method comprising 
inserting into the host cell a vector comprising the heterologous nucleic acid 
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sequence, wherein the heteroiogous nucleic acid sequence is operably linked to a 
promoter, and expressing the heterologous nucleic acid sequence. 

The invention also includes a method of expressing, in a host cell, a 
heterologous nucleic acid sequence which encodes for a protein which modifies 
5 DXP synthase, GAP dehydrogenase and/or LYTB activity in the host cell, relative 
to an untransformed host ceil, the method comprising inserting into the host cell a 
vector comprising the heteroiogous nucleic acid sequence, wherein the 
heterologous nucleic acid sequence is operably linked to a promoter, and 
expressing the heterologous nucleic acid sequence. 

10 Preferably, the isoprenoid comprises a compound derived from at least one 

member selected from the group consisting of geranyi pyrophosphate (GPP), 
farnesyl pyrophosphate (FPP) and geranyigeranyl pyrophosphate (GGPP). By the 
term "derived from", we mean that the isoprenoid contains at least one of the listed 
compounds as a building block. It is also preferable if the isoprenoid comprises at 

15 least one member selected from the group consisting of a diterpene, a carotenoid, 
an essential oil, a fragrance, an isoprene, a cytokinin, a njbber, a quinone. a sterol, 
a hopanoid, a triterpene, a steroid, a prenylated protein, a phytoalexin, a gibberellin, 
a tocopherol, a dolichol, a chlorophyll and a therapeutic compound. 

It is most preferred if the isoprenoid comprises at least one member selected 

20 from the group consisting of an essential oil, a fragrance, a rubber, a carotenoid 
and a therapeutic compound, such as paclitaxel. 

The heterologous nucleic acid sequence may originate from a eukaryotic or 
procaryotic ceil. By the term "originate from", we intend to mean that the sequence 
information for the heterologous nucleic acid came from the eukaryotic or the 

25 procaryotic cell. However, the specific nucleic acid itself does not have to be from 
the organism. The nucleic acid may come from the organism, or it may be 
synthetically produced using recombinant nucleic acid techniques known in the art. 

Preferably, the heterologous nucleic acid sequence comprises a nucleotide 
sequence for dxps, gapd and/or tytB. It is most preferred that the heterologous 

30 nucleic acid sequence comprises a nucleotide sequence which encodes a dxps, 
gapd and/or a lytB gene and is at least 85% identical, preferably at least 90%, at 
least 95% or completely identical, to SEQ ID NO: 1 , 3 and/or 5, respectively, or the 
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nucleic acid sequence encodes a protein which has an amino acid sequence which 
is at least 90%, at least 95% or completely identical to SEQ ID NO: 2, 4 and/or 6, 
respectively. Identity is determined as noted above. 

The term "modifying the production" In the methods of the invention means 
that the amount of target compounds produced (e.g., IPP, DMAPP and/or 
isoprenoids) can be enhanced or reduced, as compared to an untransformed host 
cell. Thus, in accordance with an embodiment of the present invention, the 
production or the biochemical activity of the target compounds (or the enzymes 
which catalyze their formation) may be reduced or inhibited by a number of different 
approaches available to those skilled in the art, including but not limited to such 
methodologies or approaches as anti-sense (e.g., Gray et a!., 1992), ribozymes 
(e.g., Wegener et al., 1994), co-suppression (e.g. Fray and Grierson, 1993), 
targeted disruption of the gene (e.g., Schaefer et al., 1997), intracellular antibodies 
(e.g., see Rondon and Marasco, 1997) or whatever other approaches rely on the 
knowledge or availability of the nucleic acid sequences of the invention, or the 
proteins encoded thereby. 

Host systems according to the present invention can comprise any organism 
that utilizes a nonmevalonate (i.e., via DXP) pathway for production of IPP and/or 
DMAPP. Organisms which produce Isoprenoids using IPP and/or DMAPP derived 
from a nonmevalonate pathway include plants, algae, certain bacteria, 
cyanobacteria and other photosynthetic bacteria. Transformation of these hosts 
with vectors according to the present Invention can be done using standard 
techniques. See, for example, Sambrook et al,, Molecular Cloning A Laboratory 
Marmal. Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 1989; Ausubel 
et al., Current P rotocols in Molecular Biology , Greene Publishing and Wiley 
Intersclence, New York, 1991. 

Alternatively, transgenic organisms can be constructed which include the 
nucleic acid sequences of the present invention. The incorporation of these 
sequences can allow the controlling of isoprenoid biosynthesis, content, or 
composition in the host cell. These transgenic systems can be constructed to 
incorporate sequences which allow for the overexpression of the various nucleic 
acid sequences of the present invention. Transgenic systems can also be 
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constructed which allow for the underexpression of the various nucleic acid 
sequences of the present invention. Such systems may contain anti-sense 
expression of the nucleic acid sequences of the present invention. Such anti-sense 
expression would result in the accumulation of the substrates of the enzyme 
5 encoded by the sense strand. 

Having generally described this invention, a further understanding can be 
obtained by reference to certain specific examples which are provided herein for 
purposes of illustration only and are not intended to be limiting unless otherwise 
specified. 

10 EXAMPLE 1 

Isolation of plant cDNAs and bacterial genes that enhance carotenoid accumulation 
in Escherichia coli 

The plasmid pAC-LYC (Cunningham et al,, 1994 and 1996) contains genes 
encoding all of the enzymes required for the formation of lycopene from IPP and 

1 5 DMAPP (see Figure 3). Cells of E.coli containing the plasmid pAC-LYC accumulate 
the carotenoid lycopene and thereby form colonies on solid growth medium that are 
pink in color (Cunningham et al., 1994). When cDNA or genomic libraries of 
plants, cyanobacteria, or bacteria are Introduced into the iycopene-accumulating 
E. co// strain (the complete methodology is contained in Cunningham etai, 1994 

20 and 1 996), a rare few of the colonies formed after spreading of the cell culture onto 
solid growth medium are much deeper in color than is typical of the vast majority 
of the colonies. The enhancement of color and concomitant carotenoid pigment 
accumulation in E. coli is prima facie evidence for an enhancement of flux into and 
through the pathway or pathways leading to the immediate precursors of 

25 carotenoids and all other Isoprenoid compounds: IPP and its allylic isomer DMAPP. 
Genes encoding subsequent enzymes in the pathway leading to lycopene are 
already present in the multicopy plasmid pAC-LYC (see Figure 3). 

Among the cDNAs and genes isolated with this screening methodology are: 
Tagetes erecta (marigold) cDNAs encoding a homoiogue of DXP synthase (the 

30 DNA sequence of the longest is given in Figure 4), an Exoli genomic clone 
containing the DXP synthase (and no other complete open reading: a genomic 
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fragment from the Sph\ site which lies at base 5210 in the sequence listed under 
GenBank U32768 to the A/del site at base 8003 in this GenBank sequence was 
found to enhance carotenoid accumulation in E. coli. This region encompasses the 
open reading frame earlier referred to as f620, and now as the DXP synthase), 
5 Arabidopsis thaliana cDNAs encoding a chloropiast isoform (Shih et at., 1991 and 
1992) of the enzyme GAP dehydrogenase (the sequence of the longest of these 
isolated cDNAs is given in Figure 5),and an Adonis palaestina cDNA encoding a 
homologue of the £. coli lytB gene product (Figure 6). The enhancement of 
isoprenoid biosynthesis in E. coll (as Indicated by lycopene accumulation) when 

10 cDNAs or genes encoding DXP synthase or GAP dehydrogenase are introduced 
can be understood in the context of the biochemical pathway that has been 
postulated for production of IPP/DMAPP in E. coli (and in other bacteria, 
cyanobacteria and plants). However, at the date of the provisional filing of this 
application, there had been no indications that increased flux through the pathway 

15 to IPP/DMAPP might be obtained by increasing the expression of either of these 
enzymes. A publication after the priority date of this application (Marker and 
Bramley (1999)) shows a salutary effect on isoprenoid accumulation in E coli in 
which foreign DXP synthase genes are introduced. 

The mechanism of enhancement of isoprenoid production by introduction of 

20 cDNAs encoding LYTB is not yet known. The nonmevalonate pathway leading to 
IPP and/or DMAPP has not yet been elucidated and these cDNAs may encode an 
enzyme subsequent to DXP synthase in the pathway. Alternatively, the 
enhancement of isoprenoid accumulation may involve a mechanism less direct 
(e.g., as for GAP dehydrogenase, involvement in biochemical reactions that utilize 

25 or supply the substrates GAP and pyruvate, or by exerting a regulatory influence 
on isoprenoid pathways). 

EXAMPLE 2 

Arabidopsis thaliana^ Tagetes erecta, and A donifi palaestina cDNA Libraries 

A size-fractionated 2-3 kB cDNA library of A, thaliana in lambda ZAPII 
30 (Kieber et al., 1 993) was obtained from the Arabidopsis Biological Resource Center 
at The Ohio State University (stock number CD4-15). Other size fractionated 
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libraries were also obtained (stocl< numbers CD4-13, CD4-14, and CD4-16), The 
cDNA libraries for Tagetes erecta (marigold) and Adonis palaestina (pheasant's 
eye) were constructed for us by Stratagene using cDNAs isolated from flower 
tissues. An aliquot of each library was treated to cause a mass excision of the 
5 cDNAs and thereby produce a phagemid library according to the instructions 
provided by the supplier of the cloning vector (Stratagene; E. coli strain XL1-Blue 
and the helper phage R408 were used). The titre of the excised phagemid was 
determined and the library was introduced into a lycopene-accumulating strain of 
E. co// TOP10 F' (this strain contained the plasmtd pAC-LYC) by incubation of the 

1 0 phagemid with the £ coli cells for 1 5 min at 37°C. Cells had been grown overnight 
at 30°C in LB medium supplemented with 2% (w/v) maltose and 10 mM MgS04 
(final concentration), and harvested in 1.5 ml microfuge tubes at a setting of 3 on 
an Eppendorf microfuge (541 5C) for 10 min. The pellets were resuspended in 10 
mM MgSO^ to a volume equal to one-half that of the initial culture volume. 

15 Transformants were spread on large (150 mm diameter) LB agar petri plates 
containing antibiotics to provide for selection of cDNA clones (ampiciliin) and 
maintenance of pAC-LYC (chloramphenicol). Approximately 10.000 colony forming 
units were spread on each plate. Petri plates were incubated at room temperature 
for 2 to 7 days to allow maximum color development. Plates were screened visually 

20 with the aid of an illuminated 3x magnifier and a iow power stage-dissecting 
microscope for the rare deep pink colonies that could be observed in the 
background of paler pink colonies. 

EXAMPLE 3 

FnhancfimRnf nf Carotenoid Accumulation in E. coli by cDNAs and genes encoding 
25 LYTB, DXP s ynthase. IPP isomerase and GAP DGhydrogenase Individuaiiy and in 

Combination. 

Attempts were made to maximize production of three very different 
carotenoids in E. co// engineered to accumulate these isoprenoid pigments. The 
specific carotenoids chosen were the linear and lipophilic pink compound 
30 lycopene, the bicyclic hydrocarbon p-carotene and the much more polar dihydroxy, 
bicyclic carotenoid zeaxanthin. DXP synthase, IPP isomerase, LYTB, and GAP 

18 
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dehydrogenase had been observed to enhance the pigmentation of E. co// colonies 
growing on solid agar mediunn. For a more quantitative appraisal of the influence 
of these genes and cDNAs on carotenoid accumulation, the amounts of carotenoid 
pigments in liquid cultures were examined with the results given in Figure 1. The 
5 marigold DXP synthase, an Arabidopsis IPP isomerase and the Adonis LYTB 
cDNAs gave rise to significant and substantial increases in the amount of 
carotenoids accumulated per volume of culture. A partially-sequenced Arabidopsis 
cDNA (GenBank AA605545) with sequence similarity to the Adonis lytB cDNA gave 
results similar to that of the Adonis cDNA (not shown). A synechocystis PCC6803 
1 0 lytB gene was less effective but gave a significant enhancement as well. See Table 
1. GAP dehydrogenase was slightly detrimental to pigment accumlation in liquid 
culture, in contrast to earlier observed enhancement for cultures grown on solid 
media. The influence of this gene on pigment accumulation may depend on the 
specific growth regimen, A similar result, enhancement for cultures on solid media 
but reduction in liquid culture, was also obtained for an E, co// genomic fragment 
containg DXP synthase (see Example 4 below and Figure 8). A combination of 
DXP synthase with IPP Isomerase was significantly more effective at enhancing 
pigment accumulation than was either of the individual cDNAs. 

EXAMPLE 4 

Enhancem ent and Reduction of Lycopene Accumulation in E. cai't containing 
multiple conies of aenes encoding DXP synttiase and FarnBsyl Pyrnp tin<iphafR 
svnthasR. rRF^pRntjyf^ly 

A number of the inserts in plasmids obtained from dark pink colonies 
selected in screens of the E. coii genomic library (see example 1) contained the 
gene encoding DXP synthase in this organism (see Sprenger et al., 1997; Lois et 
aL, 1998). Deletion mapping of one such E. coii genomic fragment indicated that 
the dark pink colony phenotype and enhanced carotenoid accumulation in liquid 
culture was conferred by a fragment containing the DXP synthase and no other 
complete open reading (Figure 8). A genomic fragment containing the ispA and 
xseB genes immediately upstream of dxps was, in contrast, quite inhibitory for 
carotenoid accumulation (Figure 8). This diminution in pigmentation results, we 
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surmise, from the activity of the farnesyl pyrophosphate (FPP) synthase encoded 
by ispA (Fujisaki et al.. 1990). The FPP synthase would be expected to compete 
with geranylgeranyl pyrophosphate (GGPP) synthase (Figure 1) for IPP and 
DMAPP and thereby deplete the substrate available for carotenoid biosynthesis. 

Table 1 illustrates the influence of cDNAs encoding DXP synthase, GAP 
dehydrogenase, LYTB, and IPP isomerase, individually and in combination, on the 
accumulation of lycopene, p-carotene, or zeaxanthin in three strains of E. coli 



engineered to produce these isoprenoid compounds. 



Table 1. Enhancement of carotenoid accumulation in lycopene, b- 
carotene, and zeaxanthin-accumulating strains of E. coli by introduction of 
plant cDNAs and cyanobacterial genes individually and in combination. 


Plasmid 


cDNA or 
Gene 
Product 


Lycopene 
% of control 


b-Carotene 
% of control 


Zeaxanthin 
% of control 


pBiuescript SK- 


Control (empty 
vector) 


(100) 


(100) 


(100) 


pAtipiTrc 


Arabidopsis 
thaliana IP! 


202 ± 12 


227 ±3 


223 ±4 


pTedxps 


Tagetes erecta 
DXPS 


203 ± 11 


193± 11 


228 ± 40 


pAdlytB 


Adonis 

paiaestina LYTB 


157 ±9 


180 + 3 


168 + 2 


p6803lytB 


Synechocystis 
FCC 6803 LYTB 


116± 11 


111 ± 1 


109 + 2 


pAtgapA 


Arabidopsis 
thaliana GAPD 


79 + 6 


nd 


79 ± 3 


Data were obtained with E. co// strain TOP10 containing pAC-LYC, pAC-BETA or 
pAC-ZEAX (Cunningham et al., 1994 and 1996, Sun et al., 1996). Carotenoid 
content (per volume of culture) was measured after 48-72 h growth in LB medium. 
Cultures were inoculated with freshly grown individual colonies from agar plates. 
Values are mean ± so for 4 to 13 individual cultures. The empty piasmid pBiuescript 
SK' (from Stratagene) served as the control and was used as the vector for all of the 
plant cDNAs and the Synechocystis lytB gene. The influence of GAPD on 
accumulation of b-carotene was not determined (nd). 
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We claim: 

1 . A method of enhancing the production of isopentenyi pyrophosphate (IPP), 
dimethylailyl pyrophosphate (DMAPP) and/or an isoprenoid in a host cell, relative 
to an untransformed host cell, the method comprising 
5 inserting into the host cell a vector comprising a heterologous nucleic 

acid sequence which encodes for a protein having deoxyxytulose-5-phosphate 
(DXP) synthase enzyme activity, wherein the heterologous nucleic acid sequence 
is operably linked to a promoter; and 

expressing the heterologous nucleic acid sequence, thereby 
10 producing the protein. 

2- The method of claim 1 , wherein the isoprenoid is derived from at least one 
member selected from the group consisting of geranyi pyrophosphate (GPP), 
farnesyl pyrophosphate (FPP) and geranyigeranyl pyrophosphate (GGPP). 

3. The method of claim 1 , wherein the isoprenoid comprises at least one 
15 member selected from the group consisting of an essential oil, a fragrance, a 

rubber, a carotenoid and a therapeutic compound. 

4. The method of claim 1 , wherein the isoprenoid comprises a carotenoid. 

5. The method of claim 1, wherein the heterologous nucleic acid sequence 
comprises a nucleic acid sequence for dxps. 

20 6. The method of claim 5, wherein the nucleic acid sequence for dxps encodes 
an amino acid sequence which is completely identical to SEQ ID NO: 2. 

7. The method of claim 1 , wherein the host cell is selected from the group 
consisting of a bacterial ceil, an algal cell and a plant cell. 

8. The method of claim 1 , wherein the heterologous nucleic acid sequence 
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originates from a eukaryotic cell. 

9. The method of claim 1, wherein the heterologous nucleic acid sequence 
originates from a procaryotic ceil. 

10. A method of modifying the production of isopentenyl pyrophosphate (IPP), 
dimethyialiyi pyrophosphate (DMAPP) and/or an isoprenoid in a host cell, the 
method comprising 

inserting into the host cell a vector comprising a heterologous nucleic 
acid sequence which encodes for a protein which modifies deoxyxyluiose-5- 
phosphate (DXP) synthase enzyme activity in the host cell, relative to an 
untransformed host cell, wherein the heterologous nucleic acid sequence is 
operably linked to a promoter; and 

expressing the heterologous nucleic acid sequence, thereby 
producing the protein. 

1 1 . The method of claim 10, wherein the isoprenoid is derived from at least one 
member selected from the group consisting of geranyl pyrophosphate (GPP), 
farnesyi pyrophosphate (FPP) and geranylgeranyl pyrophosphate (GGPP). 

12. The method of claim 10, wherein the isoprenoid comprises at least one 
member selected from the group consisting of an essential oil, a fragrance, a 
rubber, a carotenoid and a therapeutic compound. 

13. The method of claim 10, wherein the isoprenoid comprises a carotenoid. 

14. The method of claim 10, wherein the host cell is selected from the group 
consisting of a bacterial cell, an algal cell and a plant cell. 

15. The method of claim 10, wherein the heterologous nucleic acid sequence 
originates from a eukaryotic cell. 
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16- The method of claim 10, wherein the heterologous nucleic acid sequence 
originates from a procaryotic cell. 

1 7. A method of expressing, in a host cell, a heterologous nucleic acid sequence 
which encodes for a protein having deoxyxylulose-5-phosphate (DXP) synthase 

5 enzyme activity, the method comprising inserting into the host cell a vector 
comprising the heterologous nucleic acid sequence, wherein the heterologous 
nucleic acid sequence is operably linked to a promoter, and expressing the 
heterologous nucleic acid sequence. 

18. The method of claim 17, wherein the heterologous nucleic acid sequence 
10 comprises a nucleic acid sequence for dxps. 

19. The method of claim 18, wherein the nucleic acid sequence for dxps 
encodes an amino acid sequence which is completely identical to SEQ ID NO: 2. 

20. The method of claim 17, wherein the host cell is selected from the group 
consisting of a bacterial cell, an algal ceil and a plant cell. 

15 21. The method of claim 17, wherein the heterologous nucleic acid sequence 
originates from a eukaryotic cell. 

22. The method of claim 17, wherein the heterologous nucleic acid sequence 
originates from a procaryotic cell. 

23. A method of expressing, in a host cell, a heterologous nucleic acid sequence 
20 which encodes for a protein which modifies deoxyxylulose-5-phosphate (DXP) 

synthase enzyme activity in the host cell, relative to an untransformed host cell, the 
method comprising inserting into the host cell a vector comprising the heterologous 
nucleic acid sequence, wherein the heterologous nucleic acid sequence is operably 
linked to a promoter, and expressing the heterologous nucleic acid sequence. 
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24. The method of claim 23, wherein the host cell is selected from the group 
consisting of a bacterial cell, an algal cell and a plant cell. 

25. The method of claim 23, wherein the heterologous nucleic acid sequence 
originates from a eukaryotic cell. 

26. The method of claim 23, wherein the heterologous nucleic acid sequence 
onginates from a procaryotic cell. 

27. A method of enhancing the production of Isopentenyl pyrophosphate (IPP), 
dimethylailyl pyrophosphate (DMAPP) and/or an isoprenoid in a host cell, relative 
to an untransformed host cell, the method comprising 

inserting into the host cell a vector comprising a heterologous nucleic acid 
sequence which encodes for a protein having glyceraldehyde-3-phosphate (GAP) 
dehydrogenase enzyme activity, wherein the heterologous nucleic acid sequence 
is operabiy linked to a promoter; and 

expressing the heterologous nucleic acid sequence, thereby producing the 
protein. 

28. The method of claim 27, wherein the isoprenoid is derived from at least one 
member selected from the group consisting of geranyl pyrophosphate (GPP), 
farnesyl pyrophosphate (FPP) and geranylgeranyl pyrophosphate (GGPP). 

29. The method of claim 27, wherein the isoprenoid comprises at least one 
member selected from the group consisting of an essential oil, a fragrance, a 
rubber, a carotenoid and a therapeutic compound. 

30. The method of claim 27, wherein the isoprenoid comprises a carotenoid. 

31 . The method of claim 27, wherein the heterologous nucleic acid sequence 
comprises a nucleic acid sequence for gapd. 
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32. The method of claim 31, wherein the nucleic acid sequence encodes an 
amino acid sequence which is completely identical to SEQ ID NO: 4. 

33. The method of claim 27, wherein the host cell is selected from the group 
consisting of a bacterial cell, an algal cell and a plant ceil. 

5 34. The method of claim 27, wherein the heterologous nucleic acid sequence 
originates from a eukaryotic cell. 

35. The method of claim 27. wherein the heterologous nucleic acid sequence 
originates from a procaryotic cell. 

36. A method of modifying the production of isopentenyl pyrophosphate (IPP), 
10 dimethyiallyl pyrophosphate (DMAPP) and/or an isoprenoid in a host cell, the 

method comprising 

inserting into the host cell a vector comprising a heterologous nucleic acid 
sequence which encodes for a protein which modifies glyceraldehyde-3-phosphate 
(GAP) dehydrogenase enzyme activity in the host cell, relative to an untransformed 
15 host cell, wherein the heterologous nucleic acid sequence is operabty linked to a 
promoter; and 

expressing the heterologous nucleic acid sequence, thereby producing the 
protein. 

37. The method of claim 36, wherein the isoprenoid is derived from at least one 
20 member selected from the group consisting of geranyl pyrophosphate (GPP), 

farnesyl pyrophosphate (FPP) and geranylgeranyl pyrophosphate (GGPP). 

38. The method of claim 36, wherein the isoprenoid comprises at least one 
member selected from the group consisting of an essential oil, a fragrance, a 
rubber, a carotenoid and a therapeutic compound. 

25 39. The method of claim 36, wherein the isoprenoid comprises a carotenoid. 
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40. The method of claim 36, wherein the host cell is selected from the group 
consisting of a bacterial cell, an algal cell and a plant cell. 

41. The method of claim 36, wherein the heterologous nucleic acid sequence 
originates from a eukaryotic cell. 

5 42. The method of claim 36. wherein the heterologous nucleic acid sequence 
originates from a procaryotic cell. 

43. A method of expressing, in a host cell, a heterologous nucleic acid sequence 
which encodes for a protein having glyceraldehyde-3-phosphate (GAP) 
dehydrogenase enzyme activity, the method comprising inserting into the host cell 

10 a vector comprising the heterologous nucleic acid sequence, wherein the 
heterologous nucleic acid sequence is operably linked to a promoter, and 
expressing the heterologous nucleic acid sequence. 

44. The method of claim 43, wherein the heterologous nucleic acid sequence 
comprises a nucleic acid sequence for gapd. 

15 45. The method of claim 44, wherein the nucleic acid sequence encodes an 
amino acid sequence which is completely identical to SEQ ID NO: 4. 

46. The method of claim 43, wherein the host cell is selected from the group 
consisting of a bacterial cell, an algal cell and a plant cell. 

47. The method of claim 43, wherein the heterologous nucleic acid sequence 
20 originates from a eukaryotic cell. 

48. The method of claim 43, wherein the heterologous nucleic acid sequence 
originates from a procaryotic cell. 

49. A method of expressing, in a host cell, a heterologous nucleic acid sequence 
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which encodes for a protein which modifies glyceraldehyde-3-phosphate (GAP) 
dehydrogenase enzyme activity in the host cell, relative to an untransformed host 
cell, the method comprising inserting into the host cell a vector comprising the 
heterologous nucleic acid sequence, wherein the heterologous nucleic acid 
sequence is operably linked to a promoter, and expressing the heterologous nucleic 
acid sequence. 

50. The method of claim 49, wherein the host cell is selected from the group 
consisting of a bacterial cell, an algal cell and a plant cell. 

51. The method of claim 49, wherein the heterologous nucleic acid sequence 
originates from a eukaryotic ceil. 

52. The method of claim 49, wherein the heterologous nucleic acid sequence 
originates from a procaryotic cell. 

53. A method of enhancing the production of isopentenyl pyrophosphate (IPP), 
dimethylallyl pyrophosphate (DMAPP) and/or an isoprenoid in a host cell, relative 
to an untransformed host cell, the method comprising 

inserting into the host cell a vector comprising a heterologous nucleic acid 
sequence which encodes for a protein having LYTB activity, wherein the 
heterologous nucleic acid sequence is operably linked to a promoter; and 

expressing the heterologous nucleic acid sequence, thereby producing the 
protein. 

54. The method of claim 53, wherein the isoprenoid is derived from at least one 
member selected from the group consisting of geranyl pyrophosphate (GPP), 
farnesyl pyrophosphate (FPP) and geranylgeranyl pyrophosphate (GGPP). 

55. The method of claim 53, wherein the isoprenoid comprises at least one 
member selected from the group consisting of an essential oil, a fragrance, a 
rubber, a carotenoid and a therapeutic compound. 
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56. The method of claim 53, wherein the isoprenoid comprises a carotenoid. 

57. The method of claim 53, wherein the heterologous nucleic acid sequence 
comprises a nucleic acid sequence for lytB, 

58. The method of claim 57, wherein the nucleic acid sequence encodes an 
5 amino acid sequence which is completely identical to SEQ ID NO: 6. 

59. The method of claim 53, wherein the host cell is selected from the group 
consisting of a bacterial cell, an algal cell and a plant cell. 

60. The method of claim 53, wherein the heterologous nucleic acid sequence 
originates from a eukaryotic cell. 

10 61. The method of claim 53, wherein the heterologous nucleic acid sequence 
originates from a procaryotic cell. 

62. A method of modifying the production of isopentenyl pyrophosphate (IPP), 
dimethylallyl pyrophosphate (DMAPP) and/or an isoprenoid in a host cell, the 
method comprising 

15 Inserting into the host cell a vector comprising a heterologous nucleic acid 

sequence which encodes for a protein which modifies LYTB activity in the host cell, 
relative to an untransformed host ceil, wherein the heterologous nucleic acid 
sequence is operably linked to a promoter; and 

expressing the heterologous nucleic acid sequence, thereby producing the 

20 protein. 

63. The method of claim 62, wherein the isoprenoid is derived from at least one 
member selected from the group consisting of geranyl pyrophosphate (GPP), 
farnesyl pyrophosphate (FPP) and geranylgeranyl pyrophosphate (GGPP). 

64. The method of claim 62, wherein the isoprenoid comprises at least one 
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member selected from the group consisting of an essential oil, a fragrance, a 
rubber, a carotenoid and a therapeutic compound. 

65. The method of claim 62, wherein the isoprenoid comprises a carotenoid. 

66. The method of claim 62, wherein the host cell is selected from the group 
5 consisting of a bacterial cell, an algal cell and a plant cell. 

67. The method of claim 62, wherein the heterologous nucleic acid sequence 
originates from a eukaryotic cell. 

68. The method of claim 62, wherein the heterologous nucleic acid sequence 
originates from a procaryottc cell. 

10 69. A method of expressing, in a host ceil, a heterologous nucleic acid sequence 
which encodes for a protein having LYTB activity, the method comprising inserting 
into the host cell a vector comphsing the heterologous nucleic acid sequence, 
wherein the heterologous nucleic acid sequence is operably linked to a promoter, 
and expressing the heterologous nucleic acid sequence. 

15 70. The method of claim 69, wherein the heterologous nucleic acid sequence 
comprises a nucleic acid sequence for lytB. 

71. The method of claim 69, wherein the nucleic acid sequence encodes an 
amino acid sequence which is completely identical to SEQ ID NO: 6. 

72. The method of claim 69, wherein the host cell is selected from the group 
20 consisting of a bacterial cell, an algal cell and a plant cell. 

73. The method of claim 69, wherein the heterologous nucleic acid sequence 
originates from a eukaryotic cell. 



34 



BNSDOCID; <WO 995a649A1J_> 



wo 99/58649 



PCT/US99/07041 



74. The method of claim 69, wherein the heterologous nucleic acid sequence 
originates from a procaryotic cell. 

75. A method of expressing, in a host cell, a heterologous nucleic acid sequence 
which encodes for a protein which modifies LYTB activity in the host ceil, relative 

5 to an untransformed host cell, the method comphsing inserting into the host cell a 
vector comprising the heterologous nucleic acid sequence, wherein the 
heterologous nucleic acid sequence is operably linked to a promoter, and 
expressing the heterologous nucleic acid sequence. 

76. The method of claim 75, wherein the host cell is selected from the group 
10 consisting of a bacterial cell, an algal cell and a plant cell. 

77. The method of claim 75, wherein the heterologous nucleic acid sequence 
originates from a eukaryotic cell. 

78. The method of claim 75, wherein the heterologous nucleic acid sequence 
originates from a procaryotic cell. 
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Figure 5A: SEQ ID NO: 1 
Tagetes erecta (marigold) cDNA for 
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DXP synthase 

accctctcgt gtgtttcctg aatcacacca 
ccttgaaggg cgggtccgtg cccatagctc 
agtttgctta acccgtcagc caatgcaatc 
gtctttggga accgtagctg tttcaaaaga 
aggacccgac aacaatggat aagactacat 
ggggataaac ccaaaactcc gatcttggat 
catgaaaaat ccttgcgtgg aggagctcgt 
gagaagaaac tgtatatacg gtaccaaaga 
agcttggggg ttgtitgaatt: aacagtttca 
tccagaagac aagatcatct gggacgttgg 
aaatactgac cggaagaagg ccgagaatga 
ggacttgcag gctcccccaa gagagacgag 
cgcaggccac agctctacra gcattcctgc 
gcagagactc gctaggaaag aacaaccatg 
ggagcgacga ccgcaggaca agcacacgag 
ccragac^c" aatc-catca tagtct:"Caaa 
ngcccacagc caczactgac ggccccgcgc 
cgacccc-ca caaggcctzca aacaagccaa 
agccgcaaag gaagtaacaa agcaatcagg 
cggcaaaaa- ggacrcnc^a gtgaaaggaa 
tccatgccrg aagagct-gg cccctatcat 
caacctrgaa gaccXitgrct atgtTittcga 
ctccgggacc agrcccagcc cacactgtta 
ccgcccgctg aagctgccgc agacaaaatg 
tactcaaacc ggcaaacaaa agaaaaacaa 
cccaatattt cgtgcactct ctggttgctg 
acagtcgcga ctcatgcggc catgggcgga 
ccaaaaagag ctrcctgcac gtcgccttga 
acgcaatcac ct:::t.gctgca ggtttagcaa 
cgtgctattc attcatcttt cctacaaaga 
tgacgttgat cttcaaaagc tccctgttag 
gtttggcagg agcagatggt ccaactcact 
Cucatggctt gttraccaaa catggtggtt 
agagctcatg aacatggtgg ccacagctgt 
gctgcttcag gtacccaaga ggaaatggaa 
aataacaaag gaaccttaat cgaggttgga 
gggaaataga gcggccttat cgggatatgg 
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FIGURE 5A (cont.) 
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t -crrgtaac 


atacatggcc 


gcgctgataa 


trgtcgacat 


2401 


aancrcgtcc 
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(marigold) cDNA for 



1 


MALCGALKGG 


FVPIAQNGYT 


SSSLLNPSAN 


AIMPSinCRKF 


LGIVAVSKEH 


51 


ATKEHEDLTT 


MDKTTSTTLK 


YSGDKPKTPI 


LDTIIIYPIHM 


KNLCVEELVK 


101 


LADELREEIV 


YTVSICrGGHL 


SSSLGWELT 


VSLHHVTNTP 


EDKIIWDVGH 


151 


QAYPHKXLTG 


RRSRMRTIRQ 


TFGLAGFPKR 


DESNHDAFGA 


GHSSTSISAG 


201 


LGMAVGRDL.L 


GKNNHVTAVT 


GDGAMTAGQA 


YEAMNNAGYL 


DSNLIIVLKD 


2S1 


NRQFSLPTAT 


IDGPAPPVGA 


LSRSLTRLQT 


SQKFRQLREA 


AKEVTKQLGD 


301 


ICrHEVAAKMD 


SLVKGMVGGQ 


GASMFEELGL 


YYVGPVDGHN 


LEDLVYVFDK 


351 


IKSMTAPGPV 


LVHIVTEKGK 


GYPPAdVAAD 


KMHGWKFDT 


QTGKQKKNKT 


401 


ICrLSYTQYFV 


DSLVAEAKED 


DKIVAIHAAT-I 


GGGTGLNTFQ 


KHFPARCFDV 


451 


GIAEQKAITF 


AAGLATEGI-K 


PFCAIYSSFL 


QRGYI3QWHD 


VDLQKXPVRF 


501 


AMDRAGLVGA 


DGPTKCGAFD 


TTFMACLPNM 


\ArMAPSC21AE 


LMNMVATAVA 


551 


IDDRPSCFRY 


PRGNGIGSIL 


PAiJNKGTLIF 


VGTGRVIKEG 


NRVALLGYGT 


601 


IVQSCLAASH 


^/LKKIGISVT 


VADARFCKPL 


DGNLIKQLAW 


EHEVLITVEE 


651 


GSIGGFSSHV 


SHFUVIxNGI^L 


DGKLKWRAMM 


LPDRYIZHGA 


QSDQISEAGL 


701 


SS KEIAATVL 


SLIGGSKZTL 


HALNV* 







Figure 5B: SEQ ID NO: 2 7/13 

PREDICTED AMINO ACID SEQUENCE of Tagetes erecta 

DXP synthase 

TRANSLATION from: 55 to: 2232 Length: 726 
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Figure 6A: SEQ ID NO: 3 g^^^ 

Arabidopsis thaliana cDNA for a chloropiasts isoform of glyceraldehyde-3- 
phosphate (GAP) dehydrogenase. 



1 


ctgtccccaa 


gggtctcacc 


gaactctcag 


gaccgcgaag 


ctcctctgct 


51 


tctcttccct 


tcggcaagaa 


actttcttcc 


gatgagctcg 


tttccatcgt 


101 


ctccttccag 


acttctgcaa 


tgggaagcag 


tggtggatac 


aggaaaggtg 


151 


tgactgaggc 


caagctcaag 


gtggccacta 


atggattcgg 


taggatcggg 


201 


aagaactccc 


tgagatgctg 


gcatggtcgc 


aaggactctc 


ctcttgatat 


251 


cattgccatt 


aatgacactg 


gtggcgtcaa 


gcaggcttcg 


cacctactta 


301 


aatacgactc 


taccctcgca 


acctctgacg 


ctgatgtcaa 


accttctgga 


351 


gagactgcaa 


tctctgttga 


tggaaagatc 


acccaagttg 


tctctaaccg 


401 


aaacccgtcc 


ctizczccztz 


ggaaggagcc 


aggaactgac 


attgtcatcg 


451 


aaggaaccgg 


agngr tcgzg 


gacagagaag 


gtgcagggaa 


acacactgaa 


501 


gctggtgcca 


agaacgctat 


cactactgcc 


ccaggcaaag 


gagacattcc 


551 


aactt:atgt.c 




atgcagatgc 


ttiacagtcat 


ga-gaaccta 


601 


tcaticagcaa 


tgcacc-rgc 


accaccaact 


gtcttgc- 


cctrgncaaa 


651 


gc"cr tgacc 


aga;aa-ccgg 


Caccataaag 


ggtacaacga 


cgaccaccca 


701 


crctcacacc 


gcrgaccaca 


ggcrgccaga 


cgcgagncac 


cgcgatctaa 


751 


cgagagcaag 


agcagcrgcr 


ttgaacattg 


tccctacruc 


tacaggagca 


801 


gctaaagcrg 


Ccrcrc" ct w-c 


gcrccctaac 


ctcaaaggaa 


aacrcaacgg 


851 


gancgctctc 


cgngtaccaa 


caccaaacgc 


atcagtggtt 


gatcccgttg 


901 


tacacgtctic 


aaacaagaca 


tttgccgagg 


aagtcaacgc 


tgcc-tcaga 


951 


gattccgcag 


agaaagagct 


taaaggcata 


ctcgacgtct 


gcgacgagcc 


1001 


actagtgr cc 


gt: cga" tcca 


gatgctcaga 


tttttcaacg 


accactgatt 


1051 


catcactcac 


Cai_^^w-i_ai_ — 


ggagacgata 


tggttaaggt 


gattgcttgg 


1101 


tatgataatg 


aatggggtra 


ctcacagaga 


gttgttgact 


tggccgacat 


1151 


tgt cgccaac 


aactggaagt 


gat-Ccatca 


aactttgctt 


catgcttttc 


1201 


catttttrcc 


tct:r.tct.crt 


ttctacatta 


tggttctcaa 


aataccggtg 


1251 


aagactgtat 


a 
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Figure 6B: SEQ ID NO: 4 g^-| 3 

Predicted amino acid sequence of AfBbidopsis thaliana cDNA for a chioroplasts 
isofonn of glyceraldehyde-3-phosphate (GAP) dehydrogenase. 



1 


VPKGFTEFSG 


LRSSSASLPF 


51 


TEAKLKVAIN 


GFGRIGRNFL 


101 


YDSTLGIFDA 


DVKPSGETAI 


ISl 


GTGVFVDREG 


AGKHIEAGAK 


201 


ISNASCTTNC 


lAPFVKVLDQ 


251 


RARAAALNIV 


PTSTGAAKAV 


301 


QVSKICrFAEE 


VNAAFRDSAE 


331 


SLTMVMGDDM 


VKVIAWYDNE 


401 


FFSLSLFYIM 


VLKISVKTV 



GKKLSSDEFV SIVSFQTSAM GSSGGYRKGV 
RCWHGRKDSP LDIIAINDTG GVKQASHLLK 
SVDGKIIQW SNRNPSLLPW KELGIDIVIE 
BCVIITAPGKG DIPTYWGVN ADAYSHDEPI 
KFGIIKGTMT TTHSYTGDQR LLDASHRDLR 
ALVLPNLKGK LNGIALRVPT PNVSWDLW 
KELKGILDVC DEPLVSVDFR CSDFSTTIDS 
WGYSQRWDL ADIVA^INWK* FHQTLFHVFP 
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Figure 7A: SEQ ID NO: 5 
Adonis palaestina cDNA 
Length: 1619 June 4, 1998 



-L 


ccttcgtctc 


tgcacttctt 


tcttccatgg 


cgacttctct. 


tcaatcctgc 


C 1 
D 1 


cgc t tctcca. 


ccccatccga 


tctctcct tc 


cctgaaaccc 


gctctcccac 


1 fM 

lU 1 


tcgcct ctac 


cgatccaaga 


aaccatt tt:c 


cgtccgttgt 


cacagtgaag 


TCI 


ga. c c 1 1 c a.gg 


cccctcctcc 


acggcagtgg 


aatccgaatc 


cgacgccaag 


Z U X 


^ ^ ^ 4^ 

C C T_ U u C CgC «_ 


acaacctitac 


aagaagcaag 


aactacaacc 


gaagaggtt-t 


1 C 1 

X 


cggccacaaa 


gatgagaccc 


ccgagct tac 


gaacagcgag 


tatacaagtg 


J U X 


^ ^ ^ ^ +~ ^ ^ 13 

acgccac caa 


gaagc tgaag 


gagaacggaa 


acgagt acag 


t tggggacct 


J b X 


gt tacagcga 


aattggccga 


gccgtiatggg 


ttctgtcggg 


gtgtggaaag 


4 U X 


agctgt tcag 


actgcctatg 


aagctagaaa 


acagtttccc 


gatgagaaga 


45X 


tggattac 


taatgagact 


atccacaatc 


caactgttaa 


caagagatta 


CAT 
3 01 


gaagagatgg 


aagr taaaca 


gaccccagtc 


ggggatggga 


agaaacattt 


551 


tgatgt cgc t 


gcaaagcgcg 


acQu-gcgac 


tzcgccggct 


cctggagctg 


601 


ccgtaagtga 


aacgcrgacr: 


tcgagcgaga 


aaaatgccca 


gatagccgac 


Obi 


acaac wtigcc 


cacgggcat 


caacgcgcgg 


aat; tccgt -g 


agaagcataa 


701 


gaagggagag 


tacacaccaa 


" ratrcacgg 


caaacacrcc 


cacgaggaaa 


751 


ctattigcaac 


cgcat w-z -t: 


gc'^igggaacr 


acaucatt-gu 


gaagaacacg 


a A "1 

8 01 


gatgaggcaa 


tgcacgrgcg 


cgac"aca"r 


cttrggaggcg 


agct:raacgg 


Q C 1 

o 5 1 


at ccagctca 


gacaaacagg 


cacr zttgga 


gaaattcaaa 


tacgcaac tc 


Q A 1 

y U 1 


ccgaaggat u 


tgacccagac 


acacatctr a 


ttiaaaacagg 


cartgcaaac 


OCT 


caaactacaa 


tgcr-caacgg 


acaaacagaa 


gatattggt:a' 


aacttictcga 


1 A A T 

X U U 1 


aaagacnatg 


acgcgcaagt 


acggagttga 


gaatattiaat 


gaccacttca 


TACT 

10 51 


taagcct taa 


cacaacacgc 


gacgctacac 


aagagagaca 


agacgccatg 


T T A 1 

1101 


ttitaagt tgg 


tagaggagaa 


agcggatctc 


atattagtag 


t rggtggatg 


1151 


gaattcaagt 


aacacctcac 


atctacaaga 


aatttctgag 


ttacctggca 


T A A 1 

1201 


ttccttct ta 


ctggattgac 


agtgagacga 


gaataggacc 


aggaaacaag 


1251 


actagccaca 


aaccgaacca 


tggtgagcta 


gttgagaccg 


aaaactggct 


1301 


cccagagggc 


cctgttacaa 


ttggtgtgac 


atcaggtgct 


tctaccccag 


1351 


ataaggctgt: 


tgaagatgcc 


cttgtiaaagg 


tgttcgacat 


caagcgcgaa 


1401 


caattgttgc 


agctagcata 


gagatcagtg 


ttctgcagag 


atcaaagaac 


1451 


caccactgtt 


caatatgtgc 


gattgcatat 


gtatcatgga 


gaacacaggg 


1501 


atatagaaac 


tgtaatacct 


tgttatgtta 


tactatacgg 


atgtccttta 


1551 


ccataatgac 


gtaaatcttt 


agttccatgc 


aacaatattt 


tcttgaatga 


1601 


aaaaaaaaaa 


aaaaaaaaa 
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Figure 7B: SEQ ID NO: 6 ^^/^3 

Predicted amino acid sequence of Adonis palaestina LYTB 

Length: 465 Translate 27-1421 



1 


MATSLQFCRF 


STPSDLSFPE 


SI 


VESEFDAKSF 


RHNLTRSKNY 


101 


GNEYSWGPVT 


VKLAESYGFC 


151 


NPTVNKRLEE 


MEVKEIPVGD 


201 


EKNVQIVDTT 


CPWVSKVWNS 


251 


KYIIVKNMDE 


AMYVCDYILG 


301 


LIKTGIANQT 


TMLKGETEDI 


351 


TQERQDAMFK 


LVEEKVDLIL 


401 


TRIGPGNKIS 


HKLNHGELVE 


451 


KVFDIKREQL 


LQLA* 



TRSSTRLYRS KKPFSVRCHS EGPSGSSSTA 
NRRGFGHKDE TLELMNSEYT SDVXKKLKEN 
WGVERAVQIA YEARKQFPDE KIWITNEIIH 
GKKHFDWAK GDWILPAFG AAVSEMLTLS 
VEKHKKGEYT SIIHGKYSHE ETIATASFAG 
GEIiNGSSSDK QALLEKFKYA ISEGFDPDTD 
GKLLEKTMMR KYGVENINDH FISFNTICDA 
WGGWNSSNT SHLQEISELR GIPSYWIDSE 
TENWLPEGPV TIGVTSGAST PDKAVEDALV 
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FIGURE 8 



♦ « * * 50 

Ap:MATSLQrCRrSr?SDLSF?rrRSSTRI-rRSKK?rSVRCKS£G?SGSSSTA 

* » ★ • 100 

Ap : VE SS r DAKSFRHNXr RS KK^-NTiRGrGHKDE TLE LM^^ 

Sy : MD TKAFKR SXH n-SDN-i-HRKGFG'HGEEVMGV:MNTiEi Q^HL2 Q£ I RQN 



Ap : GNS Y S WGPy TVKggE S Y 
Sy : N y R I. £ R G D VJC Xxf^?. A F 
Ec : "- -MQX1JW?Ik?R 




150 



Ap : j PTKN: 
Ec :BRY{Svr) Si 



jIUg^lEVNIflDyvNGEKD^ aRX'id 



# 200 
p VyflLl^BFgA A V S E M L r X S 
p VvHxjpH fB Alsy E E frj Q LiN 

I A I £Es'f sH hE^'Js q a V r n e a k 



Ap : E KNVQX VjJ 
Sy : DgECT.XV 

Ec : sSnx^rvF 




^ w^i(s R E K;-:iCK kx hit 



250 

^ xiLg^^.yjEg T-XaR S S FA G 
L IE H A Gg ?g V £ qQh - ' - - 



» » ♦ * 300 

AO : fCrirrVKKMD EAMYV^CDXXIiGgElNG 3 S S D K'Q AX.ITE'K'r K YA I 3 E GT-D^Pr TX 

Sy : T jggrV L N>1 AX'A Q K vjcXi yXiX H g GD R QE JXD Y3:AN A H:S A GFMD QD 

Ec : "* - 3N?Z GG YXVE,S ?D D VWK- TVKNE S - 

♦ » * * 350 



Ap :SIST'GZA:N 
Sy lEvRXTGVAN 
Ec : - -Ki's FM? 




EO X'GJ^XEgXMMRJKIYgVE N I^TDSrF riStT.NjftgD] 



MXICGE 
M^JGS S Eg- ._-^^^-u - - ^ >^ 

S V 3 52"S"D V i 3 A XRQ^' - ^ ^ I VgP R K 





dhm:f Kg vg z k v d l i X 
SE^'F ^ SxS E D X* s i M vi 
eBvraHaQ- q az vv 




•k « 

" S N Rg Ag LA- Q R MJ 



_ 400 
[ x:S S VH^BSG 

ikraflBBda 




K I SITKLN HGZXVZ T 
RVE.HKPLGKDLZVL 



XPE GPV'T I 
XP AGK IXV 
■21^ - KZ VKCy 




GASgP 

gasRf 
gasBp: 



450 

KAKED AI*V 

kvReevmk 

X 3-Bq^'VVA 



Ap rgVFDr-KRZ Q- LLQXA 

Sy :|CILAXTkE AQ?VI,Z lAG 

Ec:RLQQLGGGEAI?LEGREZNIVFZV?KZLRVDZREVD 



4 64 
406 
316 
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Plasmid 

pSK- 
pEc3.9 

pEc3.1 
pEc2.4 
pEc1.6 
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Accumulation 
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Sail 
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Sail 
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8819 
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